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ADAPTIVE FINITE ELEMENT METHOD FOR ELLIPTIC OPTIMAL 
CONTROL PROBLEMS: CONVERGENCE AND OPTIMALITY 

WEI GONG * * AND NINGNING YAN « 


Abstract: In this paper we consider the convergence analysis of adaptive finite element 

method for elliptic optimal control problems with pointwise control constraints. We use varia¬ 
tional discretization concept to discretize the control variable and piecewise linear and continu¬ 
ous finite elements to approximate the state variable. Based on the well-established convergence 
theory of AFEM for elliptic boundary value problems, we rigorously prove the convergence and 
quasi-optimality of AFEM for optimal control problems with respect to the state and adjoint 
state variables, by using the so-called perturbation argument. Numerical experiments confirm our 
theoretical analysis. 
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1. Introduction 

Adaptive finite element method (AFEM for short), contributed to the pioneer work of Babuska 
and Rheinboldt ( 0 ), becomes nowadays a popular approach in the community of engineering 
and scientific computing. It aims at distributing more mesh nodes around the area where the 
singularities happen to save the computational cost. Various types of reliable and efficient a 
posteriori error estimators, which are used to detect the location of singularity and essential for 
the success of AFEM, have been developed in the last decades for different kind of problems, we 
refer to |36) for an overview. 

Although AFEM has been successfully applied for more than three decades, the convergence 
analysis is rather recent which started with Dorfler m and was further studied in [6l|3l 1331 SUIT]. 
Besides convergence, optimality is another important issue in AFEM which was firstly addressed 
by Binev et al. [6] and further studied by Stevenson (1311 [35]). The so-called Dbrfler’s marking 
proposed in m and quasi-error introduced in [7] consisting of the sum of the energy error and the 
scaled estimator are crucial to prove the contraction of the errors and quasi-optimal cardinality of 
the standard AFEM which avoids marking for oscillation ([13]) and circumvents the interior node 
property of mesh refinement ( |321133] ). 

AFEM also finds successful application in optimal control problems governed by partial differ¬ 
ential equations, starting from Liu, Yan |26j and Becker, Kapp, Rannacher [3]. In |3] the authors 
proposed a dual-weighted goal-oriented adaptivity for optimal control problems while in |5B] resid¬ 
ual type a posteriori error estimates were derived. We refer to umiBiiiiiniiinEiEniinii] for more 
details of recent advance. Recently, Kohls, Rosch and Siebert derived in [22] an error equivalence 
property which enables one to derive reliable and efficient a posteriori error estimators for the 
optimal control problems with either variational discretization or full control discretization. 

There also exist some attempts to prove the convergence of AFEM for optimal control problems. 
In [14] the authors considered the piecewise constant approximation of the control variable and gave 
a error reduction property for the quadruplet (u, y,p, a), where u, y,p denote the optimal control, 
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state, and adjoint state variables and tr the associated co-control variable. However, additional 
requirement on the strict complementarity of the continuous problem and non-degeneracy property 
of the discrete control problem are assumed and marking strategy is extended to include the discrete 
free boundary between the active and inactive control sets. In [4] the authors viewed the control 
problems as a nonlinear elliptic system of the state and adjoint variables and gave a convergence 
proof for adaptive algorithm involving the marking of data oscillation. In [23] the authors proved 
that the sequence of adaptively generated discrete solutions converged to the true solutions for 
optimal control problems, but obtained only the plain convergence of adaptive algorithm without 
convergence rate and optimality. In this paper we intend to give a rigorous convergence proof 
for the adaptive finite element algorithm of elliptic optimal control problem in an optimal control 
framework. We want to stress that the AFEM adopted in the current paper uses Dorfler’s marking 
1[I3)1 and is a standard algorithm in that it employs only the error indicators and does not use 
the oscillation indicators. 

Inspired by the work m of Dai, Xu and Zhou where the convergence and optimality of AFEM 
for elliptic eigenvalue problem are proved by exploiting the certain relationship between the finite 
element eigenvalue approximation and the associated finite element boundary value approximation, 
in this paper we will provide a rigorous convergence analysis of the adaptive finite element algorithm 
for the optimal control problems governed by linear elliptic equation. Under mild assumption on the 
initial mesh from which the adaptive algorithm starts, we show that the energy norm errors of the 
state and adjoint state variables are equivalent to the boundary value approximations of the state 
and adjoint state equations up to a higher order term. Then based on the well-known convergence 
result of AFEM for elliptic boundary value problems, we are able to prove the convergence of 
AFEM for the optimal control problems (OCPs for short). To be more specific, the AFEM for 
OCPs is a contraction, for the sum of the energy errors and the scaled error estimators of the state 
y and the adjoint state p, between two consecutive adaptive loops. We also show that the AFEM 
yields a decay rate of the energy errors of the state y and the adjoint state p plus oscillations of 
the state and adjoint state equations in terms of the number of degrees of freedom. This result is 
an improvement over the plain convergence result presented in |23j . 

The rest of the paper is organised as follows. In Section 2 we recall some well-known results 
on the convergence analysis of AFEM for elliptic boundary values problems. In Section 3 we 
introduce the finite element approximation of the optimal control problems and derive a posteriori 
error estimates. Adaptive finite element algorithm for the optimal control problems based on 
Dorfler’s marking is also presented. In Section 4 we give a rigorous convergence analysis of the 
AFEM for optimal control problems and the quasi-optimal cardinality is proved in Section 5. 
Numerical experiments are carried out in Section 6 to validate our theoretical result. Finally, we 
give a conclusion in Section 7 and outlook the possible extensions and future work. 

Let D C {d = 2,3) be a bounded polygonal or polyhedral domain. We denote by IU™’'^(D) 
the usual Sobolev space of order m^O, l^g<oo with norm || • |lm,g,a and seminorm | • \m,q,n- 
For q = 2 we denote IU™’^(S1) by and |1 • ||m,a = || • ||m, 2 ,a, which is a Hilbert space. Note 

that H^{n) = L^(D) and Hq{D,) = {?; G i7^(D) : n = 0 on clfi}. We denote C a generic positive 
constant which may stand for different values at its different occurrences but does not depend on 
mesh size. We use the symbol A< B io denote A ^ CB for some constant C that is independent 
of mesh size. 


2. Preliminaries 

In this section, we recall some well-known results on the adaptive finite element approximation 
to a linear elliptic boundary value problem, which are then used for the convergence analysis of 
AFEM for optimal control problems. Some of the results are collected from and see also 

m- 
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Consider the following second order elliptic equation 

IN ( Ly = f in n, 

^ > \ y = 0 ondn, 

where L is a linear second order elliptic operator of the following form: 



We denote L* the adjoint operator of L 



Here {i,j = I,-- - ,d) is symmetric, positive definite and 0 < c < oo. We denote 

A = {aij)dxd and A* its adjoint. Let 

n d dy dt) 

a(y, ?^) = ^ aij + cyv, Vy, u e Hg (H). 

It is clear that a(-, •) is a bounded bilinear form over iLp (f2) and defines a norm || • || a.n = \/a(-,-) 
which is equivalent to || • ||i,n. 

The standard weak form of (12.111 reads as follows: Find y G Hq{^) such that 

(2.2) a{y,v) = {f,v) \/v G 

For each / G the above problem admits a unique solution by the well-known Lax-Milgram 

theorem. Since the elliptic equation (12.21) is linear with respect to the right hand side /, we can 
define a linear solution operator S : L‘^{Vl) —>■ H}^{Vl) such that y = Sf. 

Let be a regular triangulation of H such that We assume that Th is shape 

regular in the sense that: there exists a constant 7* > 0 such that ^ ^ 7* for all T gTh , where 
Ht denotes the diameter of T and pr is the diameter of the biggest ball contained in T. We set 
h = maxTeTh ^T- In this paper, we use £h to denote the set of interior faces (edges or sides) of Th 
and T-Th to denote the number of elements of Th- 

On Th we construct a family of nested finite element spaces 14 consisting of piecewise linear and 
continuous polynomials such that I 4 C C{Cl)r\HQ{£l). We define the standard Galerkin projection 
operator TZh ■ —>■ I 4 by ([5]) 

(2.3) a{y-TZhy,Vh) = 0 Vu/t € 14, 
which satisfies the following stability result 

(2.4) \\nhy\\a,n < \\y\\a,n Vy G 

A standard finite element approximation to (1^ can then be formulated as: Find yh G 14 such 
that 

(2.5) a{yh,Vh) = {f,Vh) ^Vh G Vh- 

Similarly, we can define a discrete solution operator Sh ■ —>■ 14 such that yn = Shf- Thus, 

we have yh = Thy = ThSf. 

For the following purpose, we follow the idea of m to introduce the quantity K{h) as follows 

(2.6) K{h) = sup inf \\Sf-Vh\\a,n- 

/GL2(n),||/||o,s, = l«».6Vfc 

We note that the quantity K{h) is determined by the regularity of Sf which is further influenced by 
the property of domain H. Indeed, if the boundary of 17 is smooth, like C^, the additional regularity 
Sf G H‘^{Vl) holds and thus K{h) = 0{h). This is still true for polygonal or polyhedral boundaries 
if the domain is convex. The regularity is reduced, however, in the vicinity of nonconvex portions of 
polygonal or polyhedral boundaries. Grisvard proved in m the precise regularity results (Theorem 
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2.4.3 for the two-dimensional case and Corollary 2.6.7 for the three-dimensional case): there exists 
an e € (0,^], which depends on the shape of the domain, such that Sf S 772 +^( 12 ) for each 
/ G Obviously, K{h) <C 1 for G (0, ho) if /ig 'C 1. 

The following results are standard and can be found in, e.g., mm 

Proposition 2.1. For each f G Lp'{Vl), there hold 

(2.7) \\Sf - SHf\\a,n <K{h)\\f Wo,n 

and 

(2.8) WSf - ^/./||o,a < ^^mSf - ShfWa,n. 

Now we are in the position to review the residual type a posteriori error estimator for the finite 
element approximation of elliptic boundary value problem. We define the element residual rT^yh) 
and the jump residual jsivh) by 

(2.9) rrivk) ■■ = f - Lyt = f + ^ ■ [AVyh) - cyh mTh, 

(2.10) jEiyh) ■ = [AVyhlE-nE on E G £h, 

where [A\7yh]E ■ ue denotes the jump of A\7yh across the common side E of elements T+ and T~, 
He denotes the outward normal oriented to T~ . For each element T G Eh-, we define the local error 
indicator fih{yh,T) by 

( 2 . 11 ) fih{yh,T) := {h%\\fT{yh)Wl,T + hEWjE{yh)Wl,E)^■ 

E&Sh,E(ZdT 

Then on a subset w C 11, we define the error estimator fjhiyh,^^) by 

_1 

( 2 . 12 ) fih{yh,uj) ■.= (^ ^ fil{yh,T)y. 

TeTh,TCuJ 

Thus, fjh(yh, 11) constitutes the error estimator on H with respect to Th- 
For / G L^(ll) we also need to define the data oscillation as (see [32l[33]) 

1 

(2.13) osc(/,T) := Whrif - fT)Wo,T, oscif,Th) := ( ^ osc2(/,^))^ 

TeTh 

where fr denotes the L^-projection of / onto piecewise constant space on T. It is easy to see that 

(2.14) osc(/i-b/ 2 , 7 ),) < osc(/i,7)i)-bosc(/2,7)i), V/i ,/2 G T^(ll). 

For the above defined data oscillation we have the following lemma whose proof can be found in 
m Lemma 2.4]. 

Lemma 2.2. There exists a constant C, which depends on A, the mesh regularity constant 7* and 
coefficient c, such that 

(2.15) osc{Lv,Th) ^ C,^\\vWa,n, osc{L*v,Th) C»\\v\\a,n 'iv GVh- 

Now we can formulate the following global upper and lower bounds for the a posteriori error 
estimators of elliptic boundary value problems (see, e.g., [TOl [55] 1: 

(2.16) Wy-yhWl,Q. < Cii)l{yh,£l), 

(2.17) C 2 fil{yh,^) ^ Wy - yhWl,a+ C 30 SC^if - Lyh,Th). 

For our following purpose we also need to study the adjoint equation of elliptic boundary value 
problem (EU. For each g G Lf{Vl), let p G Ho{£l) be the solution of the following adjoint equation 

(2.18) a{v,p) = {g,v) Vv G H^in) 
with its finite element approximation 

(2.19) a{vh,Ph) = (g^Vh) Vv/i € 14 . 
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We can also give the a posteriori global upper and lower error bounds: 

( 2 - 20 ) Ib-P/ilIln ^ 

( 2 . 21 ) C2filiph,^) ^ \\p-ph\\l^n + C30sc^{g- L*ph,Th)- 

To analyse the adaptive finite element approximation for optimal control problem, we introduce 
a system of two source problems associated with the state and adjoint state equations, which is some 
trivial extension for the existing results of adaptive finite element approximation of scalar problem 
(see H). Specifically, we introduce the adaptive finite element algorithm to solve a system of 
elliptic boundary value problems (12.21) and (12.181) . There are different kinds of adaptive algorithms 
which differ from the marking strategies (see [3ll|3l[33]). Here we follow the Dorfler’s marking 
introduced in [13] which marks only the error estimator and avoids the marking for oscillation: 

Algorithm 2.3. The Ddrfler’s marking strategy for BVPs 

(1) Given a parameter 0 < 0 < 1; 

(2) Construct a minimal subset Tt C Th such that 

^ {fii{yh,T) + fii{jph,T)) ^e{fjl{yh,VL) + fii{ph,GL)). 

T&fh 

(3) Mark all the elements in Th- 

The adaptive algorithm for solving elliptic boundary value problems can then be described as 
follows (see [7]): 

Algorithm 2.4. Adaptive finite element method for BVPs: 

(1) Given an initial mesh Thq with mesh size ho and construct the finite element space Vhg. 

(2) Set k = 0, solve i2.5\} and S2.19\} to obtain (yhkjPhk) G ^ 14^.. 

(3) Compute the local error indicators fih^.{yhk,T) and fjhkiPhk^T) for each T € Th^- 

(4) Construct Th^ C Th^ by the marking Alaorithm \2.3[ 

(5) Refine Th^ to get a new conforming mesh Tuk+i ■ 

( 6 ) Construct the finite element space solve i2.5\} and i2.19\} to obtain {yUk+nPhk+i) G 

^hk+i ^ Ihfc+i ■ 

(7) Set k = k + \ and go to Step (3). 

We denote T the class of all conforming refinements by bisection of Tuq (see [7] for more details). 
Given a fixed number 6^1, for any Thk G T Etnd Aihk C Ifk of marked elements, 

= REFlNE(r,„M,J 

outputs a conforming triangulation Th^+i S T, where at least all elements of A4hk are bisected b 
times. We define = Thk\{Thk AThk+i) the set of refined elements satisfies Mhk C 

Then we can formulate the following standard result on the complexity of refinement, see [TJ 
Lemma 2.3] and [3S] for more details. 

Lemma 2.5. Assume that Tko verifies condition (b) of Section f in |35) . Let Thu 0) be a 

sequence of conforming and nested triangulations of Q generated by REFINE starting from the 
initial mesh Thq- Assume that Thk+i is generated from Thk by Thk+i = REFINEfiTukT M.hk) with a 
subset A4hk C Thk- Then there exists a constant Cq depending on Thq and b such that 

k 

(2.22) #Thk^, - ffThg 4 Co ^ #Mh, Vfc ^ 1. 

We define 

\\{y,p)\\l = a{y,y) + a{p,p). 
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The convergence of Algorithm 12.41 based on the marking Algorithm [23] is proven in [7] and now be¬ 
comes a standard theory for the convergence analysis of AFEM for different kind of boundary value 
problems. The following Theorem 12.61 Lemma 1^771 and Lemma 1^31 are extensions of corresponding 
results for single elliptic equation in [7] by some primary operations. We remark that in |10] the 
authors used the similar idea to prove the convergence of adaptive finite element computations for 
multiple eigenvalues. 


Theorem 2.6. Let (yhk^Phk) € Vh^. x Vh^ be a sequence of finite element solutions of problems 
and i2.18\) based on the adaptively refined mesh Th^, produced by Algorithm \2.4\ Then there 
exist constants 7 > 0 and j3 £ (0,1), depending only on the shape regularity of meshes, the data 
and the parameters used in Algorithm \2.4\ such that for any two consecutive iterates k and k + 1 
we have 


(2.23) 
Here 

(2.24) 


II (j/ - yhk +^, p - Phk +,) lla + {yhk+,, , ^)) 

< P^{\\iy-yhk,p-Phk)\\l + iivL(vhk+ (PhkM)))- 


1 


with some constant S £ (0,1). 


To prove the optimal complexity of the adaptive algorithm we need further results. The following 
lemma presents a localised upper bound estimate for the distance between two nested solutions of 
the elliptic boundary value problems ( 12 . 21 ) and (12.181) (see [TJ Lemma 3.6] and [TTJ Lemma 6 . 2 ]). 

Lemma 2.7. Let {yh^ iPh^) G Viik x and iyhk+iiPh^+i) S x be the discrete solutions 
of problems i2.^) and i2.18\) over a meshTh^. and its refinement with marked element Aih^. C 
TTifc. Let ^ be the set of refined elements. Then the following localised upper bound is 

valid 


(2.25) \\{yhk-yh,„+x,Ph,,-Phk+i)\\l^Ci {flhAyhk,T) + iil^{phk,T)). 

Consequently, we can show the optimality of the Dorfler’s marking strategy in the following 
lemma (see [3 Lemma 5.9] and [TT| Proposition 6.3] for the proof). 

Lemma 2.8. Let {yhk,Phk) G Vh,, x Vh^, and {yhk+i,Phk+i) € x be the discrete solutions 

of problems 112.^) and i2.18\) over a meshTh^ and its refinement Th^+x with marked element M.hk C 
Thk ■ Suppose that they satisfy the energy decrease property 

\\{y - yh,,+x,P - Ph^+x)\\l + lo{osc^{f - Lyh„+.^,Th,,+x) + osc^ig- L*ph„^.^,Th,,+x)) 

(2.26) 4 Po(\\{y - yhx,,p-Phx:)\\l + loiosc^if - Lyhx:,Th,,) + osc^{g - L*phx,,Thk))) 

with 70 > 0 a constant and Pq £ (0, ^). Then the set of marked elements satisfies the 

Dorfler property 


(2.27) 


E 


{vl„iyh„,T) + fil^{ph,,,T)) ^ 6 Y 


(€, iVhx 


,T) 




iPh,,T)) 






T&Thx 


with 9 = 


C 2 A- 2 PI) 

Co(Cx + A+2CiCx)lo)’ 


where Cq = max(l, &). 
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3. Adaptive finite element method for optimal control problem 


In this section we consider the following elliptic optimal control problem: 


(3.1) 

min J(y,u) = i||y- 
uGUad Z 

■ yd.\\l,Q + 

subject to 



(3.2) 

( Ly = u 

1 y = o 

in n, 
on cin. 


where a > 0 is a fixed parameter, Uad is the admissible control set with bilateral control constraints: 

Uad '■= G a ^ u a.e. in 

where a, 6 G K and a < b. 


Remark 3.1. We remark that all the theories presented below can be generalised to the case that 
the control acts on a subdomain w C In this case the governing equation reads Ly = Bu with 
the control operator B : ^ L^(fl) an extension by zero operator from lo to fl. 

With the solution operator S of elliptic equation (13.2p introduced in last section, we can formu¬ 
late a reduced optimization problem 

(3.3) min J{u) := J{Su,u) = h\Su - yd\\l^n + ^\\u\\l ^. 

uGUad Z Z 

Since the above optimization problem is linear and strictly convex, there exists a unique solution u G 
Uad by standard argument (see [IS]). Moreover, the first order necessary and sufficient optimality 
condition can be stated as follows: 

(3.4) J'{u){v - u) = {au +S*{Su-yd),v - u) ^ 0, ^vGUad, 

where S* is the adjoint of S ([II]). Introducing the adjoint state p := S*{Su — yd) G Hq{V,), we 
are led to the following optimality system: 

f a{y,v) = (u,v), 

(3.5) ^ a{w,p) = {y - yd,w), Ww e 

[ {au + p,v - u) ^ 0, \/vGUad- 

Hereafter, we call u, y and p the optimal control, state and adjoint state, respectively. From the 
last inequality of (13.51) we have the pointwise representation of u (see pSjl: 

(3.6) M(a:) = - ip(x)|, 

where P[a,b] is the orthogonal projection operator from Lf{Il) to Uad- 

Next, let us consider the finite element approximation of (I3.1l) - (l3.2p . In this paper, we use 
the piecewise linear finite elements to approximate the state y, and variational discretization for 
the optimal control u (see [10] )■ Based on the finite element space 14, we can define the finite 
dimensional approximation to the optimal control problem (13.II) - (13.21) as follows: Find (uh,yh) S 
Uad X 14 such that 

(3-7) min Mph^Uh) = l:\\yh - yd\\l,n + ^\\uh\\l^Q 

subject to 

(3.8) a{yh,Vh) = if+ Uh,Vh), 'ivh&Vh- 

Similar to the continuous case we have yu = ShUh- With this notation we can formulate a reduced 
discrete optimization problem 

(3.9) '^h{Uh) '■= Jh{ShUhTUh) = —\\ShUh — ydWo Q P ZzW'^hWo Q- 

Uh^Uad Z Z 


We note that the above optimization problem can be solved by projected gradient method or 
semi-smooth Newton method, see HU, m and [3n] for more details. 

Similar to the continuous problem (EI|)-(II2D, the above discretized optimization problem also 
admits a unique solution Uh S Uad- Moreover, the first order necessary and sufficient optimality 
condition can be stated as follows: 

(3.10) J'f,{uh)ivh - Uh) = (auh + Sl(ShUh - yd),Vh - uh) ^ 0, Mvh G Uad, 

where is the adjoint of Sh- Introducing the adjoint state ph '■= S^{ShUh — yd) G 14, the 
discretized first order necessary and sufficient optimality condition is equivalent to: 

{ a{yh,Vh) = iuh,Vh), \/vh€Vh, 

a{wh,Ph) = {Vh-yd,Wh), ywh&Vh, 

{auh+Ph,Vh-Uh) ^ 0, yvh&Uad- 

Hereafter, we call Uh, yh and ph the discrete optimal control, state and adjoint state, respectively. 
Similar to the continuous case (Ell) we have 

(3.12) Uhix) = P(7„^| - ^phix)y 

It should be noticed that Uh is not generally a finite element function in 14. 

For convenience we define y^ := Suh and p^ := S*{ShUh — yd)- It is obvious that yh and ph are 
the standard Galerkin projections of y^ and p^, i.e., yh = TZhV^ and ph — TZhP^- The following 
equivalence property is established in [22] . 

Theorem 3.2. Let iu,y,p) G Uad x H^i^) x H^{Ll) and {uh,yh,Ph) G Uad x Vh x Vh be the 
solutions of problems and respectively. Then the following an equivalence 

property holds: 

(3.13) ||u - u?i||o.n + \\y - yh\\a,n + \\p-Ph\\a,Q ~ lly^ - yh\\a,n + \\p^ - Ph\\a,n- 

Proof. For completeness we include a brief proof. Setting v = Uh in (13. 4|) and Vh = u in (I3.10|) we 
are led to 

(3.14) {au +S*{Su-yd),Uh - u) ^ 0, 

(3.15) {auh + ShiShUh-yd),u-Uh)^0. 

Adding the above two inequalities, we conclude from (13.51) and (13.111) that 
a\\u - UhWl^n 4 {Sl{ShUh - yd) - S*{Su - yd), u - Uh) 

= {ShiShUh - Vd) - S*{ShUh - yd),u- Uh) + {S*{ShUh - yd) - S*{Su - yd),u- Uh) 

= {SliShUh - yd) - S*{ShUh - yd), u-uh) + {ShUh - Su, Su - Suh) 

= {ShiShUh - yd) - S*{ShUh - yd), u-Uh) + {ShUh - Su, Su - ShUh) 

(3.16) +{ShUh - Su, ShUh - Suh). 

It follows from the e-Young inequality that 

(3.17) a||M - UhWl^n 4 C'llS'u,* - ShUhWl^n + C\\S*{ShUh - yd) - Sl{ShUh - yd)\\l,n- 
Moreover, we have 

\\y-yh\\a,Q 4 \\y - Suh\\a,n + \\Suh - yh\\a,n 

4 c\\u - UhWo.Q + \\Suh - yh\\a,Q 

and 

\\p-Ph\\a,n 4 \\p-s* {ShUh-yd)\\a,n + \\S* {ShUh-yd)-Ph\\a,n 

4 ll-S'u - ShUh\\o,n + - yd) - Ph\\a,n 

4 Gllu - u/i||o,n + ||5'*(S'/iUft - yu) - PhWa.n + ||*S'u?i - yh\\a.n- 

Combining the above estimates we prove the upper bound. 






Now we prove the lower bound. Note that 


- ShUh\\a,n < Il'S'Uh - Su\\a,n + IIS'm - ShUh\\a,n 

(3.18) < C\\u - Uh\\o,n + \\y - UhWa.n- 

Similarly, we can derive that 

II5*- Vd) - SliShUh - yd)\\a,n 

^ ||5'*(5'ftM?i - yd) - S*{Su - ?/d)||o,n + ||5'*(S'u - yd) - Sl{ShUh - 2/d)||o.n 

^ ll-S'/iM/i - 5'M||o,n + Ib-PftlU.n 

(3.19) = lb-2//i||a,n + Ib-p/ilU.n- 

Thus, we can conclude from the above estimates the lower bound. This completes the proof. □ 


Next, we will prove a compact equivalence property which shows the certain relationship between 
the finite element optimal control approximation and the associated finite element boundary value 
approximation. 

Theorem 3.3. Let h G (0,/io), {u,y,p) G Uad>^ H^{^) x H^{^) and {uh,yh,Ph) G UadxVh x 14 be 
the solutions of problems a - i3.2\) and fS. l^ - \3.8\) . respectively. Then the following equivalence 
properties hold 

(3.20) \\y-yh\\a,n = lb^ - y/^IU.n + 0(«:(^))(lb - y/^IU.n + lb - P/^IU.n), 

(3.21) \\p-Ph\\a,n = Wp'"-P h\\a.n + 0{K{h))(j\y-yh\\a.n + \\p-Ph\\a.n^ 
provided /iq <C 1. 

Proof. It is obvious that 

(3.22) y-yh = y^ -yh + y-y^, p-ph=p’^-ph+p-p^- 


Moreover, it follows from the stability results of elliptic equation that 
(3.23) lb - 2/^lla,n 4 Clb - M?i||o,n, lb - P^IU.n ^ C'lb - J/?>llo.n- 

In the following we estimate |b — y;i||o,a- Let ip G Hq{LI) be the solution of the following auxiliary 
problem 


(3.24) 


/ = y-yh in 11, 

I b = 0 on dTl. 


Let iph G 14 be the finite element approximation of ip. Then we can conclude from and the 
standard duality argument (see, e.g., i) that 


\\y-yh\\l,Q = 


4 


aiy-yh,^^) 

a{y -yh,ip- i’h) + a{y - yn, iph) 

a{y -yh,ip - Iph) + {u - Uh,iph - fj) + {u - Uh,ip) 

C'(«(^)lb - yh\\a,n + lb - M^illo^n) lb - VhWo.n, 


which in turn implies 


(3.25) lb - yh\\o,n 4 CK{h)\\y - yh\\a,n + C|b - u/j||o,n. 


Considering (I3.23P we have 

(3.26) \\p-p^\\a,n 4 C'K(/i)|b - y?i||a,n + Clb - M?i||o,n- 
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It remains to estimate ||m — Mft,||o,a- Note that it follows from (I3.14|) and (13.1511 that 

a\\u-Uh\\l^Q ^ {Sl{ShUh-Vd) - S*{Su-yd),u-Uh) 

= {Sl{ShUh-yd)-Sl{ShU-yd),u-Uh) 

+ {Sl{ShU - yd) - S*{Su- yd),u- Uh) 

= {Sh{uh - u), Sh{u - Uh)) + {Sh{ShU - yd) - S*{Su - yd), u - uh) 
^ {Sl{ShU-yd) - S*{Su-yd),u-Uh), 


which yields 
(3.27) 


||m - M?i||o,n < C\\Sh{ShU - yd) - S*{Su - 2/d)||o,n- 


Let (j) G Hq{^) be the solution of the following auxiliary problem 
(3.28) 


L4> = Sl{ShU-yd) - S*{Su-yd) in fl, 

(j) = 0 on dVL. 

Then from the standard duality argument we have 

- yd) - S*{Su - yd)\\l^n = a{(j), S^{ShU - yd) - S*{Su - yd)) 

= a[4> - 4>h,Sl{ShU - yd) - S*{Su - yd)) + a{4>h,Sl{ShU - yd) - S*{Su - yd)) 

= a{(j) - 4>h,Sl{ShU - yd) - S*{Su - yd)) + {(j)h,ShU - Su) 

(3.29) = a{(l)-(l)h,Sl{ShU-yd)-S*{Su-yd))+ {(l)h-(t),ShU - Su)+ {()), ShU-Su), 
where (j^h G 14 is the finite element approximation of (j). We can conclude from (I2.7I) - (I2.8I) that 

a((() - 4>h, Sl{ShU - yd) - S*{Su - yd)) 

(3.30) 4 CK{h)\\S*h{ShU - yd) - S*{Su - 2/d)||o.n||5:(5/»n - yd) - S*{Su - yd)\\a,n 
and 

(3.31) icj)h - ShU - Su) 4 Cn\h)\\S*h{ShU - yd) - S*{Su - yd)\\o.n\\ShU - 

(3.32) ShU - Su) 4 CK{h)\\Sl{ShU - yd) - S*{Su - j/d)||o,n||5„n - 5u|U,o. 

Then we are able to derive that 

||5^(5;,w-yd)-5*(5w-yd)||o.o 

(3.33) 4 CK(h)(||5^(4,,u-yd)-^*(^u-yd)||a,n + ||^/.^^-^u||a,n). 

Combining (13.2711 and (j3.33|l we are led to 


\\u - Uh\\o,n 


< 

< 

< 


i-^WiWSKShU - yd) - S*{Su - yd)\\a,n + IlS/jn - 5'it||a,n) 

K{h){\\ph - p||o,n + ||5'^(5'?iU - yd) - Sl{ShUh - yd)\\a,n + ||5'?iU - *S'M||a,n) 
K{h){\\ph - p||o,n + - ShUh\\a,n + - 5'u||a,n) 


< K{h){\\ph - p||o,n + \\ShUh - 5'u|la,n + - ^/lUftHo.n) 


(3.34) 


< 


i^ih){\\ph - p||o,n + hh - y\\a,n + |1m - u/i||o,n)- 


If ho 1 then K{h) <C 1 for all h G (0, ho), and we arrive at 


(3.35) 


llw- WftHo.n < K{h){\\ph - p\\a,n + \\yh - ylU.n)- 


Inserting the above estimate into (13.2311 and (13.2611 . we can conclude from (13.2211 the desired results 
(I3.20ll - (l3.2ip . This completes the proof. □ 

Now we are in the position to consider the adaptive finite element method for optimal control 
problem (I3.1ll - (l3.2p . At first we will derive a posteriori error estimates for above optimal control 
problems. To begin with, we firstly introduce some notations. Similar to the definitions of (j2.9ll and 
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(|2.10|) we 

define the e 

by 


(3.36) 

ryAyh) 

(3.37) 

rp,T{Ph) 

(3.38) 

jyAVh) 

(3.39) 

jp,E{Ph) 


= Uh-Lyh = Uh + V ■ {AVyh) - cyt in 7^, 

= Vh-Vd-L*ph = yh-yd + ^ ■ {A*Vph) - cph in 7^, 

= [AVyh]E -nE on E € £h, 

= [A*Vph\E - ns on E G £h- 
For each element T G Th, we define the local error indicators rjy^hiVhjT) and rjp^hiPhjT) by 


(3.40) VvAyh,T) := (h^ry,Tiyh)\\lT + ^Blljy.B(y?»)llo,B) % 

E&Sh,E(ZdT 

(3.41) VpAPhA)'■= {hT\\'f’p,T{Ph)\\l,T + X! ^bIIJp.s(p/i)||o,e) • 

EeEh,EcaT 

Then on a subset w C 11, we define the error estimators rjyAyhiA ppAPh,A by 


(3.42) 

r]yAyh,A ■= i 

{ ^ 

TGTh.TCw 

AyAyh,T)y, 

(3.43) 

vpAPh,A ■= 

( E 

viHiPH,T)y 


T&Th:T(Zui 


Thus, rjyAyhA) and rjpAPhA) constitute the error estimators for the state equation and the 
adjoint state equation on 12 with respect to Th- 

Note that ShUh and S'^{ShUh — yd) are the standard Galerkin projections of Suh and S*{ShUfi — 
yd), respectively. Similar to (I2.16|) - (I2.17|) . standard a posterior error estimates for elliptic bonndary 
value problem give the following upper bounds (see, e.g., [36]) which show the reliability of the 
error estimators. 


Lemma 3.4. Let S and Sh be the continuous and discrete solution operators defined above. Then 
the following a posteriori error estimates hold 

(3.44) IIS'm,! - ShUhWXn < CiplAyh,£l)^ 

(3.45) ||S'*(S'/iUh - yd) - Sl{ShUh - yd)\\Xn ^ CiplAPh,£l)- 

Then we can also derive the following global a posteriori error lower bounds, i.e., the global 
efficiency of the error estimators. 

Lemma 3.5. Let S and Sh be the continuous and discrete solution operators defined above. Then 
the following a posteriori error lower bounds hold 

(3.46) C 2 vlAyhA) ^ \\Suh-ShUhWl^a + Csosc^Ah-Lyh,Th), 

C2ApAPf^A) ^ \\S*{ShUh-yd)-S*hiShUh-yd)\\ln 

(3.47) +C30sc^{yh - yd-L*ph,Th). 

Let ho G (0,1) be the mesh size of the initial mesh Thg and define 

k{ho) := sup k(/i). 

It is obvious that k{ho) <C 1 if ho ^ 1. For ease of exposition we also define the following quantities: 
Vhiiyh,Ph),T) = plAyh,T)EplAPh,T), 
osc^{{yh,Ph),T) = oA{uh-Lyh,T) + oA{yh-yd-L*ph,T), 

and the straightforward modifications for pf^{{yh,Ph),£l) and osc'^{{yh,Ph),Th)- 

Now we state the following a posteriori error estimates for the finite element approximation of 
the optimal control problem. 
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Theorem 3.6. Let h € (0,ho). Assume that {u,y,p) S Uad x -ffo(O) x HQ{n) and [uh,yh,Ph) G 
Uad X 14 X T4 are the solutions of problems and CT-iTOI). respectively. Then there 

exist positive constants Ci, C 2 and C 3 , independent of the mesh size h, such that 

(3-48) Wiy - yh,p-Ph)\\l ^ CiVh{{yh,Ph),^) 

and 

(3.49) C 2 r]liiyh,Ph),^)< Uv - yh,P - Ph)\\l + C 30 sc^{{yh,Ph),Th) 

provided /iq <C 1. 


Proof Note that y^ = Suu, yu = ShUh, p^ = S*{ShUh - yd) and ph = S^{ShUh - yd)- From the 
estimates (|3.20p - H3.21|) . Lemmas 13.41 and 153] we have 

II {y -VH.P- Ph)\\l 4 2(11/ - ynWl^ + 11/ - p^^Wl^) + C.n^m {y-yn.p- Ph)\\1 
^ 2C'i/((y/j,p/,),rj) +Cik^{ho)\\{y-yh,p-ph)\\l 

and 

C 2 vl{{yh,Ph),^) 4 ih’"-yh\\l^n + \\p'"-Ph\\l,Q) + C3.osc^{{yh,Ph),Th) 

4 \\{y - yh,p -Ph)\\l,n + C^o^c^{{yh,Ph),Th) 

+C2k^{hQ)\\{y - yh,p - Ph)\\l- 


We obtain the desired results by choosing 
2Ci 


(3.50) 


Cl = 


1 - Cik2(/io) 


Co = 


Co 


l + C2K2(ho)' 


C3 = 


Co 


l + C2K2(ho) 


□ 


The adaptive finite element procedure consists of the following loops 
SOLVE ^ ESTIMATE ^ MARK ^ REFINE. 

The ESTIMATE step is based on the a posteriori error estimators presented in Theorem 13.61 
while the step REFINE can be done by using iterative or recursive bisection of elements with the 
minimal refinement condition (see [34l[36]). Due to [7], the procedure REFINE here is not required 
to satisfy the interior node property of [32]. Note that there are two error estimators Py,hiyh,T) and 
Vp.hiPh^T) contributed to the state approximation and adjoint state approximation, respectively. 
We use the sum of the two estimators as our indicators for the marking strategy. The marking 
algorithm based on the Ddrfler’s strategy for optimal control problems can be described as follows 

Algorithm 3.7. The Ddrfler’s marking strategy for OCPs 

(1) Given a parameter 0 < 0 < 1; 

(2) Construct a minimal subset Th C Th such that 

^ vl{{yh,Ph),T) ^ 0T]l{{yh,Ph),i^)- 

T&th 

(3) Mark all the elements in Th. 

Then we can present the adaptive finite element algorithm for the optimal control problem 
(I3J1)-(I33]) as follows: 

Algorithm 3.8. Adaptive finite element algorithm for OCPs: 

(1) Given an initial mesh Thq with mesh size hg and construct the finite element space Vhg. 

(2) Set k = 0 and solve the optimal control problem P^P- iTOIj to obtain {uh^,yhkiPhk) G 

Uad X Th^ X Vhj^ . 

(3) Compute the local error indicator rih^.{{yhk^Phk)jT). 

(4) Construct Th^ C Thk by the marking Algorithm \S.7\ 
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(5) Refine Thk io get a new conforming mesh Th^+i by procedure REFINE. 

(6) Construct the finite element space and solve the optimal control problem CT-iTOI) 

to obtain {uhk+^,yhk+i,Phk+i) £ Uad x Vhk+, x Vhk+^. 

(7) Set k = k + 1 and go to Step (3). 


4. Convergence of AFEM for optimal control problem 


In this section we intend to prove the convergence of the adaptive Algorithm 13.81 The proof 
uses some ideas of [iiiiii and some results of [7]. Following Theorem 13.31 we may firstly establish 
some relationships between the two level approximations, which will be used in our analysis for 
both convergence and optimal complexity. 


Theorem 4.1. Let h,H G (0,/iq) and {u,y,p) G Uad x x i?o(n) be the solution of problem 

Assume that iuh,yh,Ph) G Uad x 14 x F?, and {uH,yH,PH) G Uad x Vh x Vh are the 
solutions of problem CT-iTOI). respectively. Define y^ := Suh and p^ := S*{ShUh — Vd)- Then 
the following properties hold 



\\y-yh\\a,n = 

= Ib^- 

nny^W 

a,n + O{k{ho)){\\y - yh\\a,n + 

\\y- 

VhW 

0,0 

(4.1) 


+lb- 

Ph II 0,0 

+ Ib-PfflU.n), 





\\p-Ph\\a,n = 

= Ib^- 

T^hP^W 

a.n + O{k{ho)){\\y - yh\\a,n + 

\\y- 

ynl 

0,0 

(4.2) 


+lb- 

Ph II 0,0 

+ Ib-PfflU.n), 





osc{uh - Lyh,Th) = 

= osc{uH - LTihy^,Th) + O{k{ho)){\\y - yh\\a 

,,n + 

lb- 

Ph 

(4.3) 


+lb- 

ynWa.n 

+ \\p-PH\\a,n), 




osc 

iyh- yd - L*ph,Th) = 

= osc{yH - yd- 

■L*nhP^,%)+0{kiho)){\\y- 

-VhW 

0,0 


(4.4) 


+lb- 

Ph\\a,n 

+ \\y - ynWa.n + b - PnWa.n) 





and 


VvAVh,^) = Vh{'R-hy^,D) + 0{K{ho)){\\y-yh\\a,n + \\y-yH\\a,n 
(4-5) +\\p-Ph\\a,n + \\p-PH\\a,n), 

Vp,h{Ph,^) = flh{'R-hP^,Tt) + 0{K{ho)){\\y-yh\\a,n + \\y-yH\\a,n 
(4.6) +\\p - Ph\\a,n + \\p - PH\\a,n) 


provided ho 1. 


Proof. Note that 

(4.7) y - - TZhy^ + Tihiy^ - y^) + y - y^ 

and 

(4.8) p-pi^=pH - UhP^ + Uhip^ - /) + P - P^- 
On the other hand, it follows from (12.41) that 

l|7^.(^/^-/) + 2/-y^||a.^ < ||y^-y"||a,n + ||j/-y''||a,n 

< ||2/-/lla.n + ||y-y^lla.n 

(4.9) < ||u-u/t||o,n + 14 - 

and 

\\T^h{p^-P^)+P-P^\\a,n < Ib^-p^lU.n + lb-p^||i,n 

^ \\y - yhWo.n + \\y - ynWo.n 
< \\u - Uh\\o,n + K{h)\\y - yh\\a,n 

+ |b - u//||o,n + K(H)\\y - ynWa.n, 
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(4.10) 


where in the last inequality we used (13.2511 . It follows from (I3.35|) that 

WT^hiy^ -y^) + y- y^lU.n + W'R-hip^ - p'') + p - \\a,n 

^ Kih)(^\\y - yh\\a,n + \\p-Ph\\a,n^ + K{H)(^\\y - yH\\a,n + \\p-PH\\a,n^ 

(4.11) < k{ho)(^\\y-yh\\a,n + \\p-Ph\\a,n + \\y-yH\\a,n + \\p-PH\\a,n^ 

provided lio ^ 1- This combining with (I4.7I) - (I4.8I) yields (14.11) and (14.21) . 

Then we prove (I4.3p - (I4.4I) . Note that 

(4.12) uh - Lyh = UH - LTZhy^ + LTZhiy^ - y^) + i'^h - uh), 

(4.13) Vh-yd- L*ph = yH-yd- L*'R,hP^ + L*nh{p^ - p^) + (jjh - yn)- 
From Lemma [2. 2 1 we have 

osciLTZhiy^ - y^),Th) < WT^hiy^ - y^)\\a,n, 
osc{L*nh{p^-p^),Th) < W'R-hip^ - p’')\\a,n, 

which together with (14.111) imply 

osciLTZhiy^ - /),r^) +osc(L*7^/,(p^ -/),r?,) 

(4.14) < k{ho)(\\y - yh\\a,n + \\p-Ph\\a,n + \\y - yH\\a,n + lb-Pff||a,n)- 


Moreover, since fr is the L^-projection of / onto piecewise polynomials on T, there holds 


osc{f,Th)= ( - HWIt) 


< 


lo,a- 


Ten 


In view of (13.251) we thus have 

osc{uh-UH,Th) <\\uh-UH\\o,Q < ||w “ w/i||o,n + lb “ ||o,n, 
osc{yh - yH,Th) <\\yh - yH\\o,Q < lb - w/i||o.n + lb - w//||o,a 

+i^{H)\\y - yH\\a,Q + K{h)\\y - yh\\a,n, 

which together with (13.3511 yield 



osc{uh - UH,Th) < K(^o)(lb-2//i||a,n + Ib- 

(4.15) 

+ lb- yfflla,n + \\p-ph\\ 


osc{yh- yH,Th) < K(^o)(lb-2//illa,n + Ib- 

(4.16) 

+\\y-yH\\a,n + Ib-Pffll 


We can conclude the desired results (H3|) - (|ia from the definition of the data oscillation and 

dsm-diisi). 

Now it remains to prove (I4.5|l and (14.61) . From the definition of y^ and y^ we know that y^ — y^ 
is the solution of elliptic boundary value problem with right hand side Uh — uh- It follows from 
(I2.17D and (14.91) that 

f,h{n,{y^-y^),n) < ||(y'‘-y^)-7^,(/-y^)|U,^ 

+osc{uh -UH - L'Rhiy'^ - y^), %) 

< ||u - M?i||o,n + lb - MffHo.n 

+osc{uh -UH - LTZhiy'^ - Th)- 


(4.17) 
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From (I2.14I1 . (I3.35|l . (14.1411 and (I4.15|) we are led to 

osc{uh - UH - LTZhiy^ - y^),Th) ^ fiiho)(^\\y - yh\\a,n + \\p - Ph\\a,n 

(4-18) +11?/ - yH\\a,n + \\p - Pff ||a,n)- 

Note that 

VyAvh, = fjhijlhy’^, = Vhi'R-hV^ + 'R-hiy'" - y^), ^)- 
This combining with (j4.17l) and (14.181) gives 

VyAvh,^) = Vh{T^hy^A) + Aho)(\\y - yh\\a,n + \\y - yH\\a,n 

+ \\p - Ph\\a,n + \\p - Pff lu.a), 

which proves (031). Similarly we can prove (14.61) . Thus, we complete the proof of the theorem. □ 

Now we are ready to prove the error reduction for the sum of the energy errors and the scaled 
error estimators of the state y and the adjoint state p, between two consecutive adaptive loops. 

Theorem 4.2. Let {u,y,p) G Uad x HqILI) x HqA) be the solution of problem i3.1\) - i3.2\} and 
iuh^,yhk^Phk) G Uad X Vfn, X Vfn, be a sequence of solutions to problem |,g.7| j- 0.ig|] produced by 
Algorithm \3.tA Then there exist constants 7 > 0 and [3 G (0,1) depending only on the shape 
regularity of meshes and the parameter 9 used by Algorithm \3. 1\ such that for any two consecutive 
iterates k and fc + 1 , we have 

\\{y - Vh^+i^p-Ph^+i)\\l + 

(4-19) ^ A{\\{y-yhu,p-PhA\l + iyhAyhu^Phk)A)) 

provided ho <C 1. Therefore, Alaorithm, \3.8\ converges with a linear rate ft, namely, the k-th iterate 
solution {uhk,yhk,Phk) of Algorithm\3fB\ satisfies 

(4-20) Uy- yh^,p-PhA\l + ^ CoA’", 

where Cq = \\{y - yho,P - Pho)\\l + jr]lAyho^Pho)A)- 

Proof For convenience, we use {uH,yH,PH) and {uh,yh,Ph) to denote {uh,,,yhk,Phk) and {uh,,+i,yhk+i,Phk+i)^ 
respectively. So it suffices to prove that for (uhtPhtPh) and (uh,yh,Ph)^ there holds 

\\{y - yh,P - Ph)\\l + ivl{{yh,Ph)A) 

(4-21) < A{\\{y-yH,p-PH)\\l + iyH{{yH,PH)A))- 

Recall that y^ := Suh, y^ '■= Suh and p^ := S*{ShUh — yd), P^ '■= S*{ShUh — yd)- It follows 
from Algorithm 13.71 that the Dorfler’s marking strategy in Algorithm 12.31 is satisfied for {y^,p^). 

So we conclude from Theorem 12.61 that there exist constants 7 and /3 G (0,1) satisfying 

\\{y^ - TZhy^,p^ - 'TlhP^)\\l + j{fil{T^hy^, ^) + f]l{T^hP^, U)) 

(4.22) < A{\\{y^-T^ny^,P^-'llHP^)\\lPl{AH{T^Hy^,U-)+fiH{T^HP^,A)')- 
Note that TZny^ = Vh and TZhP^ = Ph, we thus have 

\\{y^ - TZhy^,p^ - nhP^)\\l + jivliTZhy^, n) + fjliUhP^, Ll)) 

(4.23) ^ i^^{\Ky^ -yH,P^-PH)\\l + i{vl,H{yH,^) + vl,H{PH,^))y 
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We conclude from (j4.1l) - (l4.2l) and (I4.5|) - (j4.6l) that there exists a constant Ca> Q such that 

\\{.y-yh,p-Ph)\\l + ivl{{yh,Ph),^) 

^ (1 + Si)\\{y^ - 'R-hy^,p^ - HhP^Wa + (1 + 5i)i{fil{'^hy^, + fji(7ZhP^,n)) 

+Ci{l + d:[^)k^{ho)(\\{y - yh,p- Ph)\\l + 11(2/ - yH,p-PH)f^ 

+Ci{l + 5j-^);t2(/io)7(|| {y -yh,p- Ph)\\l + 11(2/ - yH,p- Ph)\\1) , 
where the (5i-Young inequality is used and (5i G (0,1) satisfies 
(4.24) (1 + 5 i)/32<1. 


Thus, there exists a positive constant depending on C4 and 7 such that 
\\{y - yh,p - Ph)\\l + ivl{{yh,Ph),^) 

^ {^ + ^i){\\iy^ - T^hy^,P^ - TlkP^Wa + ^{vlinhy^M) + vliT^hP^,^))^ 

(4.25) +C'5^r^«^(^o)(||(2/ - yh,p- Ph)\\l,n + 11(2/ - yH,P - Pff)|lln) • 

It follows from (j4.23l) and (I4.25p that 

Wiy - yh,P -Ph)\\l + ivUiyh,Ph),^) 

^ {l + 5i)P'^(\\{y^ - yH,P^-PH)\\l+lvjiiiyH,PH),^)'^ 

(4-26) +C'5(5C^k^(/io)(|| ( 2 / -yh,p- Ph)\\l + 11 ( 2 / - yH,P - PhWo) ■ 

Then using Theorem 13.31 we arrive at 

\\{y - yh,p - Ph)\\l + 7vl{{yh,Ph),^) 

^ (1 + ( 5 i )^2 ((1 + CGk{ho))\\{y - yH,p- ph)\\1 + 7 ^^ ((2/ff ^)) 

+C'5(5C^K^(ho) (11 {y -yh,p- Ph)\\l + II ( 2 / - yH,p- Ph)\\1^ , 

and thus 


II (2/ -yh,p- Ph)\\l + iVhiiyh,Ph),^) 

^ {l + Si)i3'^(^\\{y-yH,p-PH)\\l+lvjiiiyH,PH),^ 

(4.27) +C4k{ho)\\{y - yH,p - Ph)\\ 1 + C46^^k‘^{ho)\\{y - yh,p - Ph)\\l, 
where C 4 is a positive constant depending on and Cq when /iq -C 1. So we can derive 

(1 - C4S^^k^{ho))\\(y -Vh.p- Ph)\\l + Ivliiyh.Ph). 

(4.28) ^ ^(1 + + C^4/^ 

or equivalently, 

Uy-yh,p-Ph)\\l^ 

{1 + 6 i)P^ + C 4 k{ho) 


\\{y-yH,p-PH)\\l + (^ + 5i)i3^7VHiiyH,PH),^), 

Vhiiyh^Ph),^) 


1 - C4(5r^K2(/lo) 


(4.29) 




-\\iy-yH,p-PH)\\c 


(1 + (5i)^2^ 


1 - C4(5r^/t2(/lo) y ^ Ula l_C4S^^k‘^{ho) 

Since k{ho) <C 1 provided that /iq <C 1, we can define the constant /3 as 


VHiiyH,PH),^)- 


g _f + C4k(hG) N 2 

^■“V l-C45pk'^{ho) ) ’ 
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(4.30) 










which satisfies /3 G (0,1) if ho <C 1. Then 

IKy- Vh^p-Ph)\\l + -— „ Ji-2,, . Vhiiyh,Ph), 

1 — C4di K^{ho) 

(4-31) ^ ^'^(ll(y-yH,P-PB)lla+ . VH((yH,PH),fi))- 

'' (It eijp^ + C74 k(aio) ^ 

Now we choose 


(4.32) 

it is obvious that 


_7_ 

i-c'4hr^^^(^o)’ 


(1 + hi)^^7 
(1 + + C4k(ho) 


(l + Si)^^(l-C4Sr^fi^(ho))j 
(l + (5i)/32 + C4h(ho) 


< (1 - (74^1 ^k^(/io))7 < 7- 


Then we obtain (j4.21|l . this completes the proof. 


□ 


Remark 4.3. We remark that the requirement ho ^ 1 on the initial mesh Tho is not restrictive 
for the convergence analysis of AFEM for nonlinear problems, such as optimal control problems 
studied in this paper, see, e.g., [M]. For similar requirement we refer to [iniin] for the convergence 
analysis of adaptive finite element eigenvalue computations and |31] for the adaptive finite element 
computations for nonsymmetric boundary value problems, we should also mention [S] for the 
adaptive finite element method of a semilinear elliptic equation. 

Remark 4.4. In adaptive Alaorithm \3.8\ we use the sum of the error estimators rjy^hiyhjT) con¬ 
tributed to the state approximation and r]p,hiPh,T) contributed to the adjoint state approximation 
as indicator to select the subset 7h for refinement. This marking strategy enables us to prove the 
convergence and quasi-optimality (see Section 5) of AFEM for optimal control problems. We re¬ 
mark that it is also possible to use the separate marking for the contributions of r]y^hiyh,T) ctnd 
Vp.hiph.T) as follows: 

• Construct a minimal subset Th,i C Th such that ^ ^ Opy f^{yh,Tf). 

Tdfh,! 

• Construct another minimal subset Th ,2 C Th such that 77 ^ f,{ph, T) ^ 6 * 77 ^ f,{ph, ^)- 

T&fh ,2 

• Set Th '■= Th,i U Th ,2 and mark all the elements in Th- 

With this marking strategy we can also prove the convergence of AEEM for optimal control problems 
by using the results of [7l [11] for single boundary value problem. To be more specific, the error 
reduction ij.22() can be derived separately for the state and adjoint state approximations. However, 
the resulting over-refinement for this marking strategy prevents us to prove the quasi-optimality of 
the adaptive algorithm. 


5. Complexity of AFEM for optimal control problem 


In this section we intend to analyse the complexity of adaptive finite element algorithm for 
optimal control problems based on the known results of the complexity for elliptic boundary value 
problems. The proof uses some ideas of [HllTl] and some results of [7]. 

Similar to [7| and m, for our purpose to analyse the complexity of AFEM for optimal control 
problems we need to introduce a function approximation class as follows 

-^7 := [ iy , P , yd ) G i7o(n) X H ^{ n ) x : \{ y , P , yd )\ s,-y < Tooj, 

where 7 > 0 is some constant and 


\{y:P,yd)\s,-r 


= sup£ inf {ffT-ffThoY- 

E>0 TcThg-- iiif(||(y-yr,p-pr)IIJ + (7+l)0SC2((yr,Pr),T))'-/^<e 


17 











Here T C 7/to means T is a refinement of Thq j 2/T and pj- are elements of the finite element space 
corresponding to the partition T. It is seen from the definition that = A’l for all 7 > 0, thus 
we use A'^ throughout the paper with corresponding norm | • So A’^ is the class of functions that 
can be approximated with a given tolerance e by continuous peicewise linear polynomial functions 
over a partition T with number of degrees of freedom #T— #7/io ^ 

Now we are in the position to prepare for the proof of optimal complexity of Algorithm l3.8l for the 
optimal control problem (I3.ip - (I3.2I) . At first, we define := Suh^, and := S*[ShkUhk — Ud)- 
Then we have the following result. 

Lemma 5.1. Let {uh^.yh^.ph^) G Uad x Vh^ x Vh^ and iuh^_^i,yhk+i,Phk+i) G Uad x 14^^^ x 14^^^ 
be discrete solutions of problem over mesh Thf, and its refinement Th^+i with marked 

element Aih^,- Suppose they satisfy the following property 

Uy-Vh fc + 1 ’ P-Phk+^)\\l+7*osc^{{yh 

fc + 1 ’ fe + 1 ) 

( 5 - 1 ) 4 fi»{\\{y-yhk,P-PH)\\l+l*osc^{{yh^,ph^),rh^)^ 

with 7 * and /3* some positive eonstants. Then for the associated state and adjoint state approxi¬ 
mations we have 

(5.2) 4 /3j(||(/"-77/,,/4 p^"- 77/,,p^'“)||^+ 7*osc^((y/,,,P/,J,7;j) 

with 

jS 7 (1 + + C^kjho) \ i ^ __ 7 *_ 

V 1 - CsSf^k^^iho) ) ’ 1 - C'5(5f^/f2(ho) ’ 

where C 5 is some constant depending on C,, Ca and Cq. C 5 , Cq and G (0,1) are some constants 
as in the proof of Thoerem 14-A 

Proof. The proof follows the similar procedure as in the proof of Theorem 14.21 when (14.5^ - 114.61) are 
replaced by (14.31) - (14.41) . Specifically, in the proof of Theorem 14.21 we use (I4.22L Theorem 13.31 and 
Theorem 14.11 to prove (I4.2ip . Conversely, here we need to prove (14.221) from (I4.21|) , Theorem 13.31 
and Theorem 14.II □ 


Next, we are able to derive a result similar to Lemma 1^751 concerning the optimality of Dorfler’s 
marking strategy for the optimal control problems. 


Corollary 5.2. Let {uhk,yhk,Phk) G Uad>^Vh^xVh^ and {uhk+i,yhk+i,Phk+i) G 17od x 14^+1 x 14^^^ 
be discrete solutions of problem over mesh Thf, and its refinement Th^+i with marked 

element Aih^,- Suppose they satisfy the following property 

Uy-yii fc+1 ’ P-Phk+i)\\l+l*osc^{{yh k+i^Phk+i')^ ) 

4 fiUWiy-yhk,p-Phk)\\l + i*osc^{{yh^,ph^),rh^)) 

with eonstants 7 * > 0 and G (0, \J\). Then the set of refined elements satisfies the 

Dorfler property 


(5.3) 


E 

T G Rtu Tu 

^k +1 


VhAiyhk,Phk),T) ^ 


^ E yhA(yhk,Phk),T) 


0 = p^(pEl+2C;k)7.) 


7 * • 
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Proof. From Lemma 15.11 we can conclude (15.211 from dsn). Note that yh,, = and pt^ = 

TZh^p^'‘. By the lower bounds in Lemma [3^ we have 


{I - 2pf)C2vl^{{yh;,,Ph^),^) ^ (1 - 2/32)(^||(/*^ - - Ph^)\\l + C30sc^{{yh^,ph^),rh^] 

= (1 - 2/3^)(II(/*^ -2/?.,,/" -Ph^)\\l + ^l*osc^{{yhu.Phk).ThS) 
^ Cq{ 1 - 2Pl){\\{y^'^ -Vhk.P^'^ - Phk)\\l + l*osc^{{yht,,Phk).ThS)- 

Thus, it follows from (15.21) that 

^(1-2^J) XI vLiiyh>c,Phk),T) 

^0 Ten^ 

^ (1 - 2/3*)(||(/'= -yhk.p’"'^ -Phk)\\l + l*osc^{{yh^,Phk),ThS) 

= IK/*’ -yhk,P^'‘ - Phk)\\l + l*osc^{{yh^,ph^,),Th^) 

-wl{\\{y’"’‘ -Vht^.p^’^ -Phk)\\l + 7 *osc 2 iiyhk,Phk),Thk)) 

^ IK/'” -Vhu^p'"^ - Phu)\\l+ %osc^{{yh^,phtf),Thk) 


- 2 { 


(||(/'= - j) 

< IK/'' -P^JIIa- IK/'' -7^/.,+,/'')||^ 

(5-4) +%(osc^{{yh^,ph^),Thk) - 2osc2((7^;J,+,/^7^,.^+lP^'=),7;,,+J). 

Note that yh^ and 'R-tk+iy’^’^ are the Galerkin projections of on 14^ and Vh^,^^, respectively. 
From the standard Galerkin orthogonality we have 

IK/'' -2//..,/'' -p/^JIIa- IK/'' -3^/..+,/^/'' -7^,,^,p'"')lK 

(5.5) = \\{yh,-nh,^,y’^'’,Ph,-nH,^,p^’^)\\l. 

By (12.151) . the triangle and the Young inequalities we have 

osc^{{yhk,Ph^,),T) 4 2osc^((7^/l,+,/^7^?J,+,p'"'),T) 

+2C2|Ky^, -7^,,^,p'"')||^, 

which together with the dominance of the indicator over oscillation (see [TJ Remark 2.1]) 

(5.6) osc^iuh^- Lyh^,T) 4 vl,hAyhk,T), 

(5.7) osc^iyh^- yd - L*ph^,T) 4 vlhk{Ph^,T) 
implies 

osc 2((2//,^,P?,J,7;J - 2osc2((7^?,,^,/^7^,,^+,p'"'“),r/.^+J 

4 ^ ( OSC {{yhf.,Phk)t'r) + OSC {{yhk:Phk)j'Thk ^'Thk+i) 

- 2 osc 2 ( {TZh^^, 2 /'"'=, TZh^+^p^^ ),Th^riTh^+,) 

(5.8) 4 (l + 2C2Ci) E vlA{yh.,Ph,),T), 
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where we used (12.2511 in the last inequality. Combining (I5.4I1 - (I5.8I1 and (12.251) we obtain 

C 2 , 


(5.9) 

By choosing 


a 


(1-2/3^) ^Liiyhk,Phk),T) 


Ten^ 


€ 


(Cl + (1 + 2 C^Ci)7,) E yLiiyh.,Ph,),T). 


T G Rj-, 7 -. 


e := 


£2 

Co 


( 1 - 2 ^ 2 ) 


^ 2 ( 1 -2^J) 


Cl + (1 + 2C2 Ci)7* Co(Ci + (1 + 2C2Ci)7*) 


we complete the proof. 


□ 


Lemma 5.3. Let iy,p,yd) G and Th^ (k ^ 0) be a sequence of meshes generated by Algorithm 
\S.b^ starting from the initial mesh Tkq ■ Let Th^+i = REFINEtfThk i-^hk) where M.hk produced by 
Algorithm \3.T\ with 0 satisfying 0 G (0, 

(5.10) 4Mhk ^ Ce,{\\{y - yuk.p - Phk)\\l + losc^{{yhk,Phk),ThS) "" l(y,P,2/d)|l, 
where the constant C 5 depends on the discrepancy between 9 and (g^((g^^E 2 c;Ci) 7 ) ■ 


Proof. Let p,pi G (0,1) satisfy pi G (0,p) and 

a ^ _ ^27 _/I _ „2'i 

^ C3(Ci + (1 + 2C2Ci)7)^ 
Choose ^1 G (0,1) to satisfy (14.241) and 

( 5 . 11 ) {i+6,rpi^p^, 
which implies 

(5.12) (l + 5i)p2<l. 


Set 

e = -^Pi(\\iy - yhk.P - Phk)\\l + losc^{{yhk,Phk),Thk)) " 
and let be a refinement of Eho with minimal degrees of freedom satisfying 

(5-13) \\iy-yh,,p-Ph,)\\l + ii + ^)osc^{{yh,,PhJ,Th,) ^ 


We can conclude from the definition of that 

#Th, - #Tho < £~~‘\{y,p,yd)\l■ 


Let Th, '■= Th^ © Thk be the smallest common refinement of Tu^ and Th^- Let Vh^ C Hq{LI) and 
14 . C Hq{LI) be the finite element spaces defined on Th^ and Th,, respectively. Assume that 
{uh^,yhe,Phe) G Uad X Vh, X Vh^ is the solution of problem (IX7l)-(l^. 

Define y^^ := Suh^ and := S*{Sh^Uh^ — yd)- From the definition of oscillation we can 
conclude from Lemma \EfI\ that 

osc{uh, - LTZh^y^ATh,) < osc{uh, - LTZh.y^ATh.) + osc{L{Tih, - TlhJy^ATh.) 

< osc{uh, - LTZh.y^ATh.) + - 'R-h,)y''''\\a 

and 


osc{yh, - yd- L*nh,p ^'', Th,) 


^ osc{yh^ - yd- L*'R.h,p^'',%,) + osc{L* (TZh, - Th, )p^^,%,) 
^ osc{yh, - yd- L*nh,p^£Th,) + C^\\{nh, - 'R-h,)p’^'')\\a- 
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Then from the Young’s inequality we have 

osc^ {{nh,y^%'Rh,p^^),Th,) ^ 2osc^ ,'Rh,p'^'^),Th,) 

+2Clmnh, - (7^^. - 7^;.J/'))||2. 

Due to the orthogonality 

\\(,y^^ -nh,y^^,p^^ -Uh.p^^Wa = IK/" 

-||((7e^. - 7^/.J/^ {Uh. - 7^/.J/'))||^, 

we arrive at 

IK/' -7^,l./^/' -7^?,./')||^ + ^osc^((7^/,./^7^/,./'),T^,J 

^ IK/" -^/.e/">/" -^ft./")lla + ^OSc2((^/ie/">^?^e/")>7'/..)- 

From Theorem 12.61 we can see that 7 ^ 2 ^> implies 

IK/' -7^;,./^p'‘' -7^/^.p'*')||^+70sc^((7^/^./^7^ft,.p'‘'),71^J 
^ I Ky"" - y) 11 2 + ^osc2 (), r^.) 

< IK/" -^/«./",P^" -^?.eP'‘")lla + (7 + /osc2((7^/,^p'‘^7^/,^p^'),r?,.) 

with (7 = ^ — 76 ( 0 , 1 ). Following the similar procedure as in the proof of Theorem 14.21 when 
(14311-(IFHI) are replaced by (I4.3F(I4.4L we are led to 

IKy - y/i.,P-P/iJlla + 70 sc 2 ((y;i.,p,^.), 7 )i.) 

^ /3o (11 (y - y/i. , P - P/» J11 a + (7 + c^)osc^ ( (yft, , P/I J , 7 ; j) 

^ /3o(lKy- y/ie:P-p/*Jlla + (7 + i)osc2((y,i^,pftj,7)ij), 


(5.14) 

where 


/3o := ( 


(1 + (5i) -|- C/ihiho )N 2 


l-Ci5^^R‘^{ho) / 

and Ci is the constant appeared in the proof of Theorem 14. 2 1 Thus, by (15.131) and (|5.14l) it follows 

IKy - y?i..P -P?i.)lla + losc^{{yh,,Ph.),Th,) 

(5-15) ^ Pf{\\{y-yhk.p-Ph^)\\l+70sc^{{yh^,Ph^),Th^)^ 

with I3i = -^PoPi- 

In view of (|5.12l) we have (3f 6 (0, / provided hg <C 1. It follows from Corollary 15.21 that 


(5.16) 

where 9i = 


J2 vLiiyhk,Phk),T) ^ 0i J2 VhAiyhk,Phk),T), 

TeTh^ 


C 2 ( 1 - 2 P^) 


C5{Ci + {i+2CiCi)ji)^ 


71 = 


7 

■prr. 


C 5 = max(l, &) and 


/=( 


l-C^S^K^iho) ’ ■ “ 71 

(1 + ^i)Pi + Cbi^iho) \ 2 


l-C^S^^kHho) J 

It follows from the definition of 7 in (12.241) and 7 in (|4.32|1 that 71 < 1, which together with C 3 > 1 
(see [TT]) implies Cg = &. Since hg <C 1, we obtain that 71 > 7 and / £ (0, -^p) from (|5.1ip . It 


















is easy to see from (13.5011 and 71 > 7 that 

^ 2 ( 1 - 2 ^?) 


01 = 




C 2 


'Cl 
7i 

C2{l + C2k^(ho)) 


^(Ci + (1 + 2C2 Ci)7i) C3(^ + 1 + 2C2Ci) 


( 1 -p^) 


^3(1 + + 1 + C2Ci(l - C,k^{ho))) 


( 1 -P^) 


(5.17) 


> 


C 2 


C3(^ + 1 + 2 C 2 Ci) 


( 1 -P^) = 


C 27 


CsiCi + (1 + 2C'2Ci)7) 


(l-p 2 )> 0 , 


provided /iq ^ 1- This implies 


VhS^yhk.Phk),T)> e ^ vl^{{yhi:,PH),T). 






Note that Algorithm 13.71 selects a minimal set = Th^ satisfying 

yhS^Vhk,Phk)iT)^ 9 ^ ylS{yhk,Phk),T). 

T^Mhu T&Thi. 


Thus, 


*Mh, < < #%. - #%, ^ - #%, 


€ 


(■^pi) "{Uy-yhk^p-PH)\\l + iosc^iiyh,,,ph^),rhi:)^ "''l(y,p,yd)|i, 


which is the desired result with an explicit dependance on the discrepancy between 6 and Q^f^Q^j^^'^ 2 C‘^Ci)'y) ' 

We are now ready to prove that Algorithm 13.81 possesses optimal complexity for the state and 
adjoint state approximations. 

Theorem 5.4. Let {u,y,p) G Uad x i7o(fl) x Hl{Vl) be the solution of problem 1,9.71) - 1,9.PI) and 
{uh^,yhn,Phn) G Uad X 14,„ X Vh^ be a sequence of solutions of problem CT-iTOI) corresponding 
to a sequence of finite element spaces Vh^ with partitions Th„ produced by Alaorithm \3.8[ Then the 
n-th iterate solution {yhn-,Ph„) of Alaorithm \3.8\ satisfies the optimal bound 

(5.18) Wiy - yhn,p-PhJ\\l + josc^{{yh^,PhJ,Th„) < {#Th^ -#Tho)~'^\ 

where the hidden constant depends on the exact solution {u,y,p) and the discrepancy between 9 

nr,rt _ ^727 _ 

c7(c7+(T+2Cjcrh) ■ 

Proof. It follows from (12.221) and (15.101) that 






n—1 


(5.19) 


< 


^[\\{y-yhk,p-Ph^)\\l + iosc^{{yh„,Ph^),Thk)) '“I {y, p^ yd)\s. 




From the lower bound (|3.49p we have 

Wiy - yh^,P - Ph„)\\l + IvlJiyhk.Phi,),^) ^ C(i(\\{y - yh,,,p - Ph^)\\l + yosc^ ( (p/ifc , P?.J , T;j) , 
where Cg = max(l + ^). Then we arrive at 

n —1 _^ 

(5.20)#rft„ -ffTho < (\\{y - yh„,P - Phi,)\\l + ivl,^{{yhi:,Ph„),^)) \{y,p,yd)\L 
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Due to (I4.19|l . we obtain for 0 ^ k < n that 

\\{y - yhr,,P -PhJWl + 7VhA{yh,,,PhJ,^) < {\\{y - vh^p - Ph,^)\\l + ivl^{{yhk,PH),^)) ■ 


Thus, 


#Th„-#Tho < i\\{y-yh^,p-PhJ\\l + lylSiyhr^,Ph^),^)) \{y.p.yd)\l^P 




(5.21) 


^ {\\{y-yhr,,p-Ph^)\\l + ml ( {yh^.Ph^)M 


\{y,p,yd)\s, 


□ 


where the last inequality holds due to the fact that /3 < 1. 

From (I01l-(I5T1) we have 

osc^{{yhr,,PhJ,ThJ < Vh„iiyhniPhJ,^), 
which together with (15.211) yields 

_1^ 

(5.22) #Th^-#Tho < (\\iy-yh^,p-PhJ\\l + msc^iiyh„,PhJ,Thj'^ 

this completes the proof. 

Remark 5.5. From iS.35\) and the equivalence property iS.lS]) we can conclude that Theorem \4-S\ 
also implies the convergence of ||w —u/j;.||o,n, namely, for the n-th iterate solution Uh„ of Algorithm 
1 ,9.(^1 there holds 

(5.23) l|w-«^JIo.a</3"". 

ITe remark that the control variable can also be included into the complexity analysis of AFEM for 
optimal control problems to obtain 

(5.24) ||u - UhJln < (#r.„ - Who)-^^- 

Flowever, the above results are sub-optimal for the optimal control as illustrated by the numerical 
results in Section 6. To prove the optimality of AFEM for control variable it seems that we need 
to work with AEEM based on L'^-norm error estimators, we refer to |20) for optimal a priori error 
estimate. We expect that the results in m will enable us to prove the optimal convergence of 
AFEM for the optimal control u, this will be postponed to future work. 


6. Numerical experiments 

In this section we carry out some numerical tests in two dimensions to support our theoretical 
results obtained in this paper. We take the elliptic operator L as —A with homogeneous Dirichlet 
boundary condition for all the examples. 

Example 6.1. This example is taken from [I]. The domain D can be described in polar coordinates 
by 

Tl = {(r,"d), l<r<l, 0<'d< ^tt}. 

We take the exact solutions as 

y{r,-d) = (r"^ — r*"^) sin(A-d), 

p{r,'d) = a(r'^ — r*"^) sin(Ad), 

u{r,d) = PuA--) 
a 

with A = I and pi = ^'2 = |. We set a = 0.1, a = —0.3 and 6=1. We assume the additional 
right hand side f for the state equation. 
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We give the numerical results for the optimal control approximation by Algorithm 13.81 with 
parameter 9 = 0.4 and 9 — 0.5. Figured] shows the profiles of the numerically computed optimal 
state and adjoint state. We present in Figured) the triangulations by Algorithm 13.81 after 8 and 10 
adaptive iterations. We can see that the meshes are concentrated on the reentrant corner where 
the singularities located. 



Figure 1 . The profiles of the discretised optimal state yh (left) and adjoint state 
Ph (right) for Example 16.II on adaptively refined mesh. 


















Figure 2. The meshes after 8 (left) and 10 (right) adaptive iterations for Example 
16.11 generated by Algorithm 13.81 with 9 = 0.4. 

To illustrate the efhciency of adaptive finite element method for solving optimal control prob¬ 
lems, we show in the left plot of Figure [3] the error histories of the optimal control, state and 
adjoint state with uniform refinement. We can only observe the reduced orders of convergence 
which are less than one for the energy norms of the state and adjoint state, and less than two for 
the L^-norm of the control. In the right plot of Figure [3| we present the convergence behaviours 
of the optimal control, state and adjoint state, as well as the error estimators rjy^hivh,^) and 
Vp.viPh,^) for the state and adjoint state equations with adaptive refinement. In Figure ID we 
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present the convergence of the error \\{y — yh,P — Ph)\\a and error indicator r]h{{yh,Ph),^) with 
9 = 0.4 and 9 = 0.5, respectively. It is shown from FigurelUthat the error IKj/ — yh,P — Ph)\\a is 
proportional to the a posteriori error estimators, which implies the efficiency of the a posteriori 
error estimators given in Section 3. Moreover, we can also observe that the convergence order of 
error ||(y — yh-,P~Ph)\\a is approximately parallel to the line with slope —1/2 which is the optimal 
convergence rate we can expect by using linear finite elements, this coincides with our theory in 
Section 5. For the error ||w — we can observe the reduction with slope —1, which is better 

than the results presented in Remark 15.51 and strongly suggests that the convergence rate for the 
optimal control is not optimal. 


Uniform refinement 



Error reductions 



Figure 3. The convergence history of the optimal control, state and adjoint state 
on uniformly refined meshes (left), and the convergence of the errors and estima¬ 
tors on adaptively refined meshes (right) for Example 16.11 generated by Algorithm 

ITSI 

Example 6.2. In the second example we consider an optimal control problem without explicit 
solutions, we set fl = (—1,1)^, a = 10“^, a = —10 and b = 10. The desired state yd is chosen as 
10 , 1, —10 and —1 in the first, second, third and fourth quadrant, respectively. 

Similar to the above example Figure [S] shows the profiles of the numerically computed optimal 
state and adjoint state. We present in the left plot of Figure [5] the triangulation generated by 
Algorithm 13.81 after 8 adaptive iteration with parameter 9 = 0.5. Since there are no explicit 
solutions we can not show the convergence of the error \\{y — yuiP — Ph)\\a as in Example 16.11 
Instead we show in the right plot of FigurelHlthe convergence of the error indicator r]h{{yh,Ph), ff), 
the error estimators rjy^hiyh,^) and rjp^y{ph,Tl) for the state and adjoint state equations. We can 
observe the error reduction with slope —1/2. 

Example 6.3. In the third example we also consider an optimal control problem without explicit 
solutions defined on domain = (—1,1) x (—1,1)\[0,1) x (a;i,0]. We set a = 10~^, a = 0 and 
6 = 8. We take the desired state yd = ‘2. 

We show in Figure [7] the profiles of the numerically computed optimal state and adjoint state, 
singularities for both the state and adjoint state can be observed around the reentrant corner. We 
present in the left plot of Figure [5] the triangulation generated by Algorithm 13.81 after 8 adaptive 
iteration with parameter 9 = 0.5 which is locally refined around the corner. Since there are no 
explicit solutions we also show in the right plot of Figure [5] the convergence of the error indicator 
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10 ' 


Error reductions 


Error reductions 




Figure 4. The convergence history of the optimal control, the state and adjoint 
state and error indicator on adaptively refined meshes with 0 = 0.4 (left) and 
6 — 0.5 (right) for Example 16.11 generated by Algorithm 13.81 



Figure 5. The prohles of the discretised optimal state yh (left) and adjoint state 
Ph (right) for Example 16.21 on adaptively refined mesh. 


rih{{yh,Ph)i^), tbe error estimators rjy^hiyh,^) and rip^y{ph,^) for the state and adjoint state 
equations. We can also observe the error reduction with slope —1/2. 

7. Conclusion and outlook 

In this paper we give a rigorous convergence analysis of the adaptive hnite element algorithm 
for optimal control problems governed by linear elliptic equation. We prove that the AFEM is a 
contraction, for the sum of the energy errors and the scaled error estimators of the state y and the 
adjoint state p, between two consecutive adaptive loops. We also show that the AFEM yields a 
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Figure 6. The mesh (left) after 8 adaptive iterations and the convergence history 
of the error estimators on adaptively refined meshes (right) with 9 = 0.5 for 
Example 16.21 generated by Algorithm 13.81 



Figure 7. The profiles of the discretised optimal state yt (left) and adjoint state 
Ph (right) for Example 16.31 on adaptively refined mesh. 

decay rate of the energy errors of the state y and the adjoint state p plus oscillations of the state 
and adjoint state equations in terms of the number of degrees of freedom. 

We expect that the results should also be valid for optimal Neumann boundary control prob¬ 
lems (see [27]) by the following observations. The key point for the convergence analysis is the 
equivalence properties presented in Theorem 13.31 where the relation between the finite element 
optimal control approximation and the standard finite element boundary value approximation is 
established. Consider the governing equation of the Neumann boundary control problem: 

{ Ly = f in n, 

( AVy ■ n = u on d^. 

Similar to the proof of Theorem 13.31 we can conclude from the trace theorem that 
\\u-Uh\\o,dn ^ i^Hh){\\y - yh\\a,n + \\p-Ph\\a,n)^ 
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Mesh after 10 iterations 


Error reductions 




Number of elements 


Figure 8. The mesh (left) after 10 adaptive iteration and the convergence history 
of the error estimators on adaptively refined meshes (right) with 0 = 0.4 for 
Example 16.31 generated by Algorithm 13.81 


where Uh is the discrete optimal control. Then we can obtain the counterpart of (I3.20I) - (I3.21I) for 
Neumann boundary control problems 

\\y-yh\\a.n = h'"-yh\\a,n + 0{Ki{h))(^\\y-yh\\a,n + \\p-Ph\\a,ny 

\\p-Ph\\a,n = \\p^-Ph\\a,n + 0{K^h))(^\\y-yh\\a,n + \\p-Ph\\a,nJ 

provided /iq <C 1. Thus, the convergence and complexity analysis of AFEM carries out to the 
Neumann boundary control problems. 

There are many important issues remained unsolved for the convergence analysis of AFEM for 
optimal control problems compared to AFEM for boundary value problems. Firstly, at this moment 
we only prove the optimality of AFEM for energy errors of the state and adjoint state variables, 
the convergence for the optimal control u is sub-optimal. To prove the optimality of AFEM for 
the optimal control u it seems that we should work on the optimality of AFEM for boundary 
value problems under L^-norms, as done in m- This complicates the convergence analysis with 
additional restrictions to the adaptive algorithms and will be postponed to future work. 

Secondly, the convergence analysis of the adaptive finite element algorithm for other kind of 
optimal control problems like Stokes control problems (see [28)1. and non-standard finite element 
algorithm such as mixed finite element methods (see [8]) remains open and will be addressed in 
forthcoming papers. 

Thirdly, we only prove the convergence of AFEM for optimal control problems with control 
constraints by using variational control discretization. The full control discretization concept 
by using piecewise constant or piecewise linear finite elements is also very important among the 
numerical methods for control problems. This kind of control discretizations results in an additional 
discretised control space and an additional contribution to the a posteriori error estimators (see [22) 1 
which should be incorporated within the adaptive algorithm and the corresponding convergence 
analysis. We also intend to generalise our approach in this paper to analyse the convergence of 
AFEM for optimal control problems with full control discretization in the future. 
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